A unifold, mesofold, and superfold model of protein fold use.
نویسندگان
چکیده
As more and more protein structures are determined, there is increasing interest in the question of how many different folds have been used in biology. The history of the rate of discovery of new folds and the distribution of sequence families among known folds provide a means of estimating the underlying distribution of fold use. Previous models exploiting these data have led to rather different conclusions on the total number of folds. We present a new model, based on the notion that the folds used in biology fall naturally into three classes: unifolds, that is, folds found only in a single narrow sequence family; mesofolds, found in an intermediate number of families; and the previously noted superfolds, found in many protein families. We show that this model fits the available data well and has predicted the development of SCOP over the past 2 years. The principle implications of the model are as follows: (1) The vast majority of folds will be found in only a single sequence family; (2) the total number of folds is at least 10,000; and (3) 80% of sequence families have one of about 400 folds, most of which are already known.
منابع مشابه
Accommodation of a highly symmetric core within a symmetric protein superfold.
An alternative core packing group, involving a set of five positions, has been introduced into human acidic FGF-1. This alternative group was designed so as to constrain the primary structure within the core region to the same threefold symmetry present in the tertiary structure of the protein fold (the beta-trefoil superfold). The alternative core is essentially indistinguishable from the WT c...
متن کاملIncreased Cytotoxicity of Cisplatin in SK-MEL 28 Melanoma Cells upon Down-Regulation of Melanoma Inhibitor of Apoptosis Protein
Background: Malignant melanoma is a highly metastatic cutaneous cancer and typically refractory to chemotherapy. Deregulated apoptosis has been identified as a major cause of cancer drug resistance, and upregulated expression of the inhibitor of apoptosis protein melanom, an inhibitor of apoptosis (ML-IAP) is frequent in melanoma. Methods: Based on the conclusion that ML-IAP expression contribu...
متن کاملMultiple structural alignment for distantly related all beta structures using TOPS pattern discovery and simulated annealing.
Topsalign is a method that will structurally align diverse protein structures, for example, structural alignment of protein superfolds. All proteins within a superfold share the same fold but often have very low sequence identity and different biological and biochemical functions. There is often significant structural diversity around the common scaffold of secondary structure elements of the f...
متن کاملMSAT: a multiple sequence alignment tool based on TOPS.
This article describes the development of a new method for multiple sequence alignment based on fold-level protein structure alignments, which provides an improvement in accuracy compared with the most commonly used sequence-only-based techniques. This method integrates the widely used, progressive multiple sequence alignment approach ClustalW with the Topology of Protein Structure (TOPS) topol...
متن کاملConsequences of domain insertion on sequence-structure divergence in a superfold.
Although the universe of protein structures is vast, these innumerable structures can be categorized into a finite number of folds. New functions commonly evolve by elaboration of existing scaffolds, for example, via domain insertions. Thus, understanding structural diversity of a protein fold evolving via domain insertions is a fundamental challenge. The haloalkanoic dehalogenase superfamily s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Proteins
دوره 46 1 شماره
صفحات -
تاریخ انتشار 2002